Improving the Prediction Accuracy of Liver Disorder Disease with Oversampling

نویسنده

  • HYONTAI SUG
چکیده

The complexity of liver makes it easily affected by disease of disorder. So diagnosing liver disorder disease is a high interest to data miners, and decision trees have been useful data mining tools to diagnose the disease, but the accuracy of decision trees has been limited due to insufficient data. In order to generate more accurate decision trees for liver disorder disease this paper suggests a method based on over-sampling in minor classes to compensate the insufficiency of data effectively. Experiments were done with two representative algorithms of decision trees, C4.5 and CART, and a data set, ‘BUPA liver disorder’, and showed the validity of the method. Key-Words: biased sampling, liver disorder disease, classification.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving the Performance of Machine Learning Algorithms for Heart Disease Diagnosis by Optimizing Data and Features

Heart is one of the most important members of the body, and heart disease is the major cause of death in the world and Iran. This is why the early/on time diagnosis is one of the significant basics for preventing and reducing deaths of this disease. So far, many studies have been done on heart disease with the aim of prediction, diagnosis, and treatment. However, most of them have been mostly f...

متن کامل

Diabetes Prediction by Optimizing the Nearest Neighbor Algorithm Using Genetic Algorithm

Introduction: Diabetes or diabetes mellitus is a metabolic disorder in body when the body does not produce insulin, and produced insulin cannot function normally. The presence of various signs and symptoms of this disease makes it difficult for doctors to diagnose. Data mining allows analysis of patients’ clinical data for medical decision making. The aim of this study was to provide a model fo...

متن کامل

Diabetes Prediction by Optimizing the Nearest Neighbor Algorithm Using Genetic Algorithm

Introduction: Diabetes or diabetes mellitus is a metabolic disorder in body when the body does not produce insulin, and produced insulin cannot function normally. The presence of various signs and symptoms of this disease makes it difficult for doctors to diagnose. Data mining allows analysis of patients’ clinical data for medical decision making. The aim of this study was to provide a model fo...

متن کامل

پیش بینی بیماری‌های کبدی با استفاده از مدل مارکف پنهان

Background: The liver is the largest internal organ and the most important organ after heart and brain in the human body without which life is impossible. Diagnosis of liver disease requires a long time and sufficient expertise of the doctor. Statistical methods can be classified as an automated forecasting system and help specialists for quickly and accurately diagnose liver disease. Hidden Ma...

متن کامل

Automatic classification of Non-alcoholic fatty liver using texture features from ultrasound images

Background: Accurate and early detection of non-alcoholic fatty liver, which is a major cause of chronic diseases is very important and is vital to prevent the complications associated with this disease. Ultrasound of the liver is the most common and widely performed method of diagnosing fatty liver. However, due to the low quality of ultrasound images, the need for an automatic and intelligent...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012